AITopics | quality score

Supplementary Materials for Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Neural Information Processing SystemsApr-29-2026, 19:21:13 GMT

The details of multiple datasets for OIQA task are presented in Table A. For the dataset that contains scanpath coordinates, we can directly sample viewport sequences from it and use our network to predict the quality scores. However, it is challenging and costly to record user scanpath data for every ODI in realistic scenarios. The scanpath information is likely unavailable when evaluating the quality of a panorama. Therefore, we propose a generalized Recursive Probability Sampling (RPS) method to generate multiple pseudo viewport sequences for the panorama, which assists the network to predict an accurate quality score in a way that is similar to the observer's actual scoring process. In JUFE and JXUFE, each ODI consists of 300 viewport coordinates, recorded using a head-mounted display (HMD).

artificial intelligence, machine learning, quality assessment, (15 more...)

Neural Information Processing Systems

Country: Asia (0.15)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.72)

Add feedback

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Neural Information Processing SystemsApr-29-2026, 19:21:09 GMT

Blind Omnidirectional Image Quality Assessment (BOIQA) aims to objectively assess the human perceptual quality of omnidirectional images (ODIs) without relying on pristine-quality image information. It is becoming more significant with the increasing advancement of virtual reality (VR) technology. However, the quality assessment of ODIs is severely hampered by the fact that the existing BOIQA pipeline lacks the modeling of the observer's browsing process. To tackle this issue, we propose a novel multi-sequence network for BOIQA called Assessor360, which is derived from the realistic multi-assessor ODI quality assessment procedure. Specifically, we propose a generalized Recursive Probability Sampling (RPS) method for the BOIQA task, combining content and details information to generate multiple pseudo viewport sequences from a given starting point.

artificial intelligence, human computer interaction, machine learning, (20 more...)

Neural Information Processing Systems

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (1.00)
Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

41da609c519d77b29be442f8c1105647-Paper.pdf

Neural Information Processing SystemsApr-25-2026, 15:12:29 GMT

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Genre: Research Report (0.46)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment Tianhe Wu

Neural Information Processing SystemsFeb-17-2026, 03:59:10 GMT

Extensive experimental results demonstrate that Assessor360 outperforms state-of-the-art methods on multiple OIQA datasets.

artificial intelligence, human computer interaction, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
North America > United States > Maryland > Prince George's County > College Park (0.04)
Europe > Netherlands > Drenthe > Assen (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Human Computer Interaction > Interfaces (0.68)

Add feedback

The Target-Charging Technique for Privacy Analysis across Interactive Computations

Neural Information Processing SystemsFeb-16-2026, 23:26:42 GMT

We propose the T arget Charging T echnique (TCT), a unified privacy analysis framework for interactive settings where a sensitive dataset is accessed multiple times using differentially private algorithms. Unlike traditional composition, where privacy guarantees deteriorate quickly with the number of accesses, TCT allows computations that don't hit a specified target, often the vast majority, to be essentially free (while incurring instead a small overhead on those that do hit their targets). TCT generalizes tools such as the sparse vector technique and top-k selection from private candidates and extends their remarkable privacy enhancement benefits from noisy Lipschitz functions to general private algorithms.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Nevada (0.04)
North America > Canada > British Columbia > Vancouver (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(9 more...)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.68)

Add feedback

a1c716638d9b618a1a40a96f473c8250-Paper-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 06:03:11 GMT

adversarial attack, artificial intelligence, machine learning, (18 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Guangzhou (0.04)

Industry: Information Technology > Security & Privacy (0.73)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.95)
(2 more...)

Add feedback

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Neural Information Processing SystemsFeb-11-2026, 07:41:23 GMT

While recent advancements in large multimodal models (LMMs) have significantly improved their abilities in image quality assessment (IQA) relying on absolute quality rating, how to transfer reliable relative quality comparison outputs to continuous perceptual quality scores remains largely unexplored.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Poland (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Vulnerabilities in Video Quality Assessment Models: The Challenge of Adversarial Attacks

Neural Information Processing SystemsDec-26-2025, 11:31:10 GMT

No-Reference Video Quality Assessment (NR-VQA) plays an essential role in improving the viewing experience of end-users. Driven by deep learning, recent NR-VQA models based on Convolutional Neural Networks (CNNs) and Transformers have achieved outstanding performance. To build a reliable and practical assessment system, it is of great necessity to evaluate their robustness. However, such issue has received little attention in the academic community. In this paper, we make the first attempt to evaluate the robustness of NR-VQA models againstadversarial attacks, and propose a patch-based random search method for black-box attack. Specifically, considering both the attack effect on quality score and the visual quality of adversarial video, the attack problem is formulated as misleading the estimated quality score under the constraint of just-noticeable difference (JND). Built upon such formulation, a novel loss function called Score-Reversed Boundary Loss is designed to push the adversarial video's estimated quality score far away from its ground-truth score towards a specific boundary, and the JND constraint is modeled as a strict $L_2$ and $L_\infty$ norm restriction. By this means, both white-box and black-box attacks can be launched in an effective and imperceptible manner.

name change, video quality assessment model, vulnerability, (7 more...)

Neural Information Processing Systems

Industry:

Information Technology > Security & Privacy (0.43)
Government > Military (0.43)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.59)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.59)

Add feedback

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Neural Information Processing SystemsDec-25-2025, 00:48:14 GMT

While recent advancements in large multimodal models (LMMs) have significantly improved their abilities in image quality assessment (IQA) relying on absolute quality rating, how to transfer reliable relative quality comparison outputs to continuous perceptual quality scores remains largely unexplored. To address this gap, we introduce an all-around LMM-based NR-IQA model, which is capable of producing qualitatively comparative responses and effectively translating these discrete comparison outcomes into a continuous quality score. Specifically, during training, we present to generate scaled-up comparative instructions by comparing images from the same IQA dataset, allowing for more flexible integration of diverse IQA datasets. Utilizing the established large-scale training corpus, we develop a human-like visual quality comparator. During inference, moving beyond binary choices, we propose a soft comparison method that calculates the likelihood of the test image being preferred over multiple predefined anchor images. The quality score is further optimized by maximum a posteriori estimation with the resulting probability matrix. Extensive experiments on nine IQA datasets validate that the Compare2Score effectively bridges text-defined comparative levels during training with converted single image quality scores for inference, surpassing state-of-the-art IQA models across diverse scenarios. Moreover, we verify that the probability-matrix-based inference conversion not only improves the rating accuracy of Compare2Score but also zero-shot general-purpose LMMs, suggesting its intrinsic effectiveness.

adaptive image quality assessment, artificial intelligence, machine learning, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining

Fan, Dongyang, Hashemi, Diba, Karimireddy, Sai Praneeth, Jaggi, Martin

arXiv.org Artificial IntelligenceNov-27-2025

Incorporating metadata in Large Language Models (LLMs) pretraining has recently emerged as a promising approach to accelerate training. However prior work highlighted only one useful signal-URLs, leaving open the question of whether other forms of metadata could yield greater benefits. In this study, we investigate a wider range of metadata types and find other types of metadata, such as fine-grained indicators of document quality that can also accelerate pretraining when prepended. We identify a common feature among effective metadata: they encode information at a finer granularity. We further introduce metadata appending as a means of improving training efficiency, where predicting an appropriate metadata as auxiliary task can help speed up pretraining. In addition, learnable meta-tokens trained with masked loss can recover part of the speedup by inducing quality-aware latent structure. Using probing, we analyze latent representations to understand how metadata shapes learning. Together, these results yield practical guidelines for integrating metadata to improve both the efficiency and effectiveness of LLM pretraining.

artificial intelligence, large language model, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.21613

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Filters

Collaborating Authors

quality score

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Supplementary Materials for Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment

41da609c519d77b29be442f8c1105647-Paper.pdf

Assessor360: Multi-sequence Network for Blind Omnidirectional Image Quality Assessment Tianhe Wu

The Target-Charging Technique for Privacy Analysis across Interactive Computations

a1c716638d9b618a1a40a96f473c8250-Paper-Conference.pdf

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Vulnerabilities in Video Quality Assessment Models: The Challenge of Adversarial Attacks

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining